DOCUHEHT 8ESUHE 



ED 050 741 



LI 002 800 



AUTHOR 

TITLE 

INSTITUTION 
PUB DATE 
NOTE 



EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Herling, John P. ; Lazorick, Gerald J. 

Proposal for a Current Awareness Service for the 
Literature of Library and Information Science. 

State Univ-, of New York, Albany. 

68 

27p. ; Paper prepared tor the Conference on the 
Bibliographic Control of Library Science Literature, 
State University of New York at Albany, April 19-20, 
196 8 

EDRS Price MF-$0.65 HC-S3.29 

♦C ommunica tion (Thought Transfer) , Computers,-,. 
Conferences, Government Publications, Information 
Needs, ♦ Information Science, ♦Library Science, 
♦Permuted Indexes, Scholarly Journals, Vocabulary 
Development 

Keyword Out of Context, ♦Library Science Literature, 
Machine Readable Data 



ABSTRACT 



To meet the need for quick access to the literature, 
and to obtain more effective communication among librarians, 
information scientists, mass communication specialists, operations 
analysts, and behavioral scientists, the authors propose a KWOC 
(Key-Word Out-of-Context) Index using existing machine-readable 
bibliographical data, such as supplied by the Institute of Scientific 
Information and Pandex, supplemented by in-house conversion of core 
journals and U.S. Government Research Reports. Some 340,000 items, 
(annually) would be matched against a set of authors 1 names, journal 
titles, and index terms or groups of terms, the letter weighted to 
eliminate false drops. Preliminary computer output would be manually 
screened, then matched against an open vocabulary to provide the 
output in the form of a monthly printed bulletin arranged by subject, 
with an author index. (Related documents are LI 002 796 - 002 799, 
and LI 002 801 - 002 807). (Author) 
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It is an ironic fact that control of the literature of biblio- 
graphical control - the literature of iibrarianship - the 
literature of information science and technology - is inadequate. 
Librarians who frequently deplore the inadequacies of abstract- 
ing and indexing services in most disciplines, have themselves 
produced no prompt , comprehensive services for their own 
discipline. Although there seems to be a growing awareness of 
the problem ; it is only limited, for one does not find im- 
proved methods of control of library literature among the forty 
four major goals for action of the American Library Association 
in 1967 (1) , nor in the recent survey in one of the leading 

library journals of trends in bibliography in twenty five areas 
of knowledge, is there a paper devoted to trends in the bibli- 
ography of Iibrarianship (2) . 

There are important and useful bibliographic services in library 
and information science. H. Allan Whately (3) identified twenty 
one. We do not denigrate such services as Library Literature 
and Library Science Abstracts . More recent additions to the 
bibliography of this field Documentation Abstracts and the 
Annual Review of Information Science and Technology are valu- 
able contributions to the control of the literature. However, 
most of us would agree that none of the currently published 
abstracting and indexing services is doing a good job in respect 
to coverage and promptness. Alan Gilchrist, who checked a 
bibliography on the evaluation of information systems against 
six services, found that the best coverage (68%) was achieved 
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by American Documentatio n (in its "Literature Notes , 11 



now 



Documentation Abstracts ) and by Library Literature (58%). The 
time lag between publication of originals and publication of 
abstracts averaged 213 days in Library Literature and 281 days 
in American Documentation (4). Whatley (5) comments on the 
failure of the services to cover the report literature and 
cites the editors' estimates of a three to six month time lag. 
We do not propose to solve all the problems of bibliographical 
control of the literature of library and information science and 
technology , nor do we believe there is "one best way". We agree 
with Phyllis Richmond (6) that, "Rather than trying to achieve 
the impossibility of being all things to all men, which i.s again 
giving at least lip-service to the principle of "the one best 
way", it is more prudent to have several systems with differing 
objectives and differing levels of complexity or detail, as 
well as varying design, composition and operation, and to use 
them in conjunction with each other, both directly in some cases 
and indirectly in others - all in a shotgun rather than a rifle 
approach. " 

The system we propose is a computer-based current awareness 
service for the literature of library and information science 
and technology. No one can deny that we are in a period of ex- 
plosive developments in librarianship when quick access to the 
latest results of experimentation and research and to the latest 
information on new methods and new equipment for the acquisition, 
recording and transfer of information is of paramount importance. 
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The design of our system and our approach to current awareness 
will be discussed later. We should like to emphasize, here, our 
conviction of the need for such a service. 'Whatley's survey of 
the purposes for which librarians and library school staff and 
students used present abstracting and indexing services indicated 
that the primary purpose was current awareness (7). Although this 
survey employed only the questionnaire technique and was based on 
a relatively small sample, it is probably the most extensive that 
has been made in the field of library studies. Previously pub- 
lished studies of librarians as users of library literature are 
scarcely worth citing. But this is not unique to librar ianship 
for user studies in general (most of these are in the scientific 
information field) are of questionable value (8). Saul Herner 
(9) at the conclusion of an analysis of the crudeness of user 
studies makes the point that though there are signs of convergence, 
there are still two different groups of people working in the 
information fields [_ 1 the working librarians and information 
specialists and 2 the mass communication specialists, oper- 
ation analysts, and behavioral scientists. Only when we have 
obtained more effective communication within the field, he says, 
can we realistically aspire to understand and interpret the 
communication and information gathering habits of its clientele. 
Current awareness tools, he adds, are one of the means to obtain 
more effective communication within the field. 

THE APPROACH TO CURRENT AWARENESS 
We will design the system to keep a library or information 
scientist abreast of what is happening in his field of interest, 
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and as a service to organizations trying to remain in the fore- 



front of this rapidly expanding field. The system is based on 



the premise that information is most valuable if it is supplied 
and digested as soon as possible after publication. In this way, 
the scientist or practitioner can more quickly transfer informa- 
tion into new, improved or expanded systems and services. 

To accomplish the current awareness objective, the system supply- 
ing the information must be fast . It should require as little of 
the user’s time as possible . It should be e conomical - the cost 
to benefit ratio must be as low as possible,, It should have high 
precision - the information the man receives should he pertinent 
to his interests. He should have to scan as little irrelevant 
information as possible. Our system will computer scan each of 
the titles and authors of app: ‘ ‘ " r 



the government research report literature available from the 
Clearinghouse for Federal Scientific and Technical Information 
and patent information which may soon be available in machine 
readable form. The data bank will also include information about 
new scientific and technical books published in the United States. 
The system identifies published items which should be of interest 
to the individual. By pointing a man toward information which he 
does not regularly see, it supplements, but is not a substitute 
for, the man's own attempts to keep abreast of the expanding 
technologies .. 

The author-title information is indexed, using a modified version 
of Luhn 1 s derivative indexing methods o Thus the advantages of 



appearing in more than 2,000 




this type of permuted indexing and the outgrowth of his method - 
KWIC (Keyword in Context) and KWOC (Keyword out of Context) - 
are inherent in our method of computer scanning. Tukey (10) 
stated that the permuted index was invented as a means of ade- 
quately controlling (essentially of indexing) the literature 
without extensive intellectual effort, and thus eliminating 
indexing delays. Cleverdon (11) commented that the great merit 
of this particular method ... is that it enables information 
concerning new articles to be made available much more quickly 
than if there were the inevitable delays of human abstracting 
and indexing. Janaske (12) indicates that - though there are 
disadvantages which he points out - perhaps the greatest advan- 
tage is the timeliness and the speed with which permuted title 
indexes can be prepared. Skaggs and Spangler (13) tested a 
permuted indexing system and stated, "The most obvious advan- 
tage of permuted indexing by computer is speed. In a test of 
one permuted indexing system, input of 3,000 cards containing 
titles and running text produced a permuted significant word 
index of 12,190 index entry lines, with approximately 85 
minutes of computer time required for the permuting and sort 
operation." Kennedy (14) states, "that the use of the author’s 
own terms ... the alive currency of new ideas «, . . rather 

than the considered reshapings of the indexing system may often 
be of advantage. The automatic generation as .index entries of 
all the separate words in multi-term concepts is definitely so. 
Access is direct, under any one of the component terms, in the 
unrestricted manner of Uniterm indexing and context minimizes 
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false drops; the author has supplied the term coordination." 
Stevens (15) points out that, in general, even among enthusiasts 
of KWIC , there is more agreement as to the values of the tech- 
nique as a device for current awareness scanning and as a 
dissemination index than its use for more extensive searching. 

She indicates that, in fact, it was primarily as a dissemination 
index that Luhn first proposed the KWIC technique. Ruhl (16) 
found that between 50 and 90 percent of author prepared titles 
(the variation depending on subject field and other circumstances) 
did fully reflect the index terms assigned to these documents by 
human indexers. Stevens (17) also states that, "Lane and White 
and Walsh have also made studies directly related to th# question 
of KWIC index effectiveness." The latter two investigators 
report only 52 percent retrieval effectiveness for a permuted 
title index to the Abstracts of Computer Literature - 1962 which 
they attribute to the changing terminology in the still new field 
of computer technology. Lane (18) made counts of titles that 
would be "acceptable" and those that would not for a KWIC index 
for 50 titles drawn from each of 10 published indexes. He con- 
cluded that, if there were judicious pre-editing, technical 
articles in the technical subject indexes could be adequately 
covered, and papers in the field of law, business and humanities 
somewhat less satisfactorily so, but that for material indexed 
in the Reader’s Guide to Periodical Literature , the KWIC tech- 
nique would fail 58 percent of the time. Montgomery and Swanson 
(19) have studied the adequacy of "machine-like indexing by 
people." Montgomery and Swanson took as their test corpus the 



September, 1960 issue of Index Medicus and found that for 4,770 



items, 85.8 percent contained either the word itself or a synonym 
for the subject heading assigned, slightly over 11 percent did 
not, and in the remaining cases the investigator could not clearly 
decide. The National Science Foundation's CR&D Report No. 11 (20) 

cites a study of the Index to Legal Periodicals. The report 
states, "Interpretation of data revealed, among other things, that 
64.4 percent of the title entries contained as keywords one or 
more of the ILP subject heading words under which they were in- 
dexed and 25.1 percent contained logical equivalents . The re- 
maining 10.5 percent of the title entries had nondescriptive 
titles . " 

Many individuals compare computer generated indexes with human 
indexing and comment about the superior quality of huma., index- 
ing. Researchers have studied this and also the speed of human 
indexers. Cleverdon (21) states that few reliable figures have 
been given for current practices, although a particularly high 
figure is the 1 1/2 hours average quoted for indexing reports 
for the catalogue of aerodynamic data prepared by the National 
Fluchtvaart Laboratories in Holland. Further, he points out 
that it appears from personal discussions that 20 minutes per 
report for a general collection of technical reports is the top 
limit, and this has been taken as the maximum indexing time to 
be used in the project. 

Black (22) states, "that there have been enough experiments to 
indicate that there is no consistency, or very little, between 
one indexing performance by a given individual and another in- 
dexing performance at a later date, by the same individual. 



The same inconsistency has been discovered among different in- 
dividuals all indexing the same documents." Thus, there is 
neither inter-indexer nor intra-indexer consistency in a system 
that depends on human performance. Stevens (23) states, "There 
can be little doubt that quality and consistency of most human 
indexing, practically available today, is not good. Much of it, 
because of time and other pressures, is either directly a word 
extraction process, or it is inconsistent in assignment of many 
relevant descriptors and subject category labels. On the other 
hand, today 1 s indexing, whether accomplished by man or machine, 
is probably no better or no worse than any other classif icatory 
or indexing procedures. The only excuse, therefore, for choice 
between man and machine is the cost/benefit ratio which is re- 
lated on the one hand to specific operational considerations 
and on the other to the question of whether or not various users 
would agree with the machine as much as they agree with each 
other." Blpck (24) writes, "that it has been estimated that' 
the efficiency of KWIC indexing is about 76 percent compared 
with about 82 percent for the conventional indexing or class- 
ification . 11 

Perhaps the most significant commeni of all was made by 
Wyllys (25) : 

"It is well-known that the current methods of 
producing, through human efforts, condensed 
representations of documents are already hope- 
lessly inadequate to cope with the present 
volume of scientific and technical literature. 

Many papers are never indexed or abstracted 



at all, and even in the cases of those that 
are indexed or abstracted, the indexes and 
abstracts do not become available until six 
months to two years after the publication of 
the paper. " 

THE INPUT TO THE SYSTEM 

As a data_ base we will use machine readable tapes, such as those 
presently available from Pandex and the Institute of Scientific 
Information, which cover over 2,000 scientific and engineering 
journals including the following important titles in library and 
information science : 

Aslib Proceedings 
American Documentation 

Bulletin of the Medical Library Association 
Information Storage and Retrieval 
Journal of Chemical Documentation 
Journal of Documentation 

Library Resources and Technical Services 
Methods of Information in Medicine 
Nachrichten fur Dokumentation 
Technical Information Center Administration 
Other journals of related interest, available on these tapes ares 
American Behavioral Scientist 

Association for Computing Machinery, Communications 
Computer Bulletin 
Computer Design 
Computing 
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Data Processing 
Datamation 

IBM Journal of Research and Development 

IEEE Spectrum 

IEEE Transactions 

Operations Research 

Science 

To supplement these data, machine readable information concern- 
ing all current technical reports available from the Clearing- 
house for Federal Scientific and Technical Information as found 
in the publication, U.S. Government Research and Development 
Reports is already available at the Technical Information 
Dissemination Bureau, We will ourselves convert to machine 
readable form the bibliographic information for all substantive 
articles appearing in the following journals: 

ALA Bulletin 

Aktulne Problemy Informacji i Dokumentacji 
(Warsaw) 

Annals of Library Science and Documentation 

Association Internationale les Documentalistes 
Bulletin 

Atlantic Provinces Library Association Bulletin 
Bibliography, Documentation Terminology 
Bulletin des Bibliotheques de France 
Cahiers de la Documentation 
Canadian Library 
College and Research Libraries 
D. C. Libraries 

11 
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Drexel Library Quarterly 
Herald of Library Science 
Illinois Libraries 
Indexer 

Indian Association of Special Libraries and 
Informat n Centres, Bulletin 

Journal of education for Librar ianship 

Journal of Library History 

Knihovnik (Prague) 

Law Library Journal 

Library Association Record 

Library Journal 

Library Quarterly 

Library Review 

Library Science with a Slant to Documentation 
Library Trends 
Library World 
Libri 

Metodica A Tehnika Informed (Prague) 

Music Library Association Notes 
National Micro-News 

Nauchno-Technicheskaya Inf ormatsiya 
New Zealand Libraries 
North Carolina Libraries 
Ohio Library Association Bulletin 
Ontario Library Review 
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PNLA Quarterly 

Pennsylvania Library Association Bulletin 
Problemy Peredachi Informatsii 
Scientific Information Notes 
Southeastern Libraries 
South African Libraries 
Special Libraries 
Technicheski Biblioteki SSSR 
Tennika Knihovna (Prague) 

Tidskrift for Dokumentation (Stockholm) 

Tijdschrift voor Efficiente e Documentatie 
Tudomanyos es Muszaki Tajekoztatas 
UNESCO Bulletin for Libraries 

Verband der Bibliotheken des Landes Nordrheiitf - 
Westfalen , Mitteilungsblatt 

Wilson Library Bulletin 

Zeitschrift fur Bibliothekswesen 

In addition , by means of a computer-produced mailing , we will 

solicit from individuals and organizations engaged in research 

and development in library and information science, reports cf 

researc in progress, new technical reports, and preprints of 

papers, thus augmenting the informal channels of communication 

that play an important part in the transfer of information in 

all fields (26) „ We shall also publish advertisements of the 

service in selected journals, in which we will request reports 

and papers to be submitted for indexing • 6,000 new book titles 

(yearly) are also already available in machine readable form. 

13 
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The information for each item in the bank consists of, - at 
least - authors, title, journal abbreviation, volume, page 
numbers, and a unique identifying number. Non descriptive 
titles have subtitles appended to them (if a subtitle is used) 
or the first portion of an abstract (up to 174 characters) if 
an abstract is available. Many of the journal article titles 
are enriched with index terms. All report titles are enriched 
with index terms. Report information consists of ciuthors , title, 
index terms, report number (AD, PB , etc.) and corporate author. 
Contract numbers are also given. Each book title is enriched 
with about ten index terms. 

Thus, the base we will search annually will total an estimated 
341,500 articles, documents, and book titles comprising the 



following : 

1) From journals available on tape 300,000 

2) Clearinghouse Reports Converted to tape 30,000 

3) Library journals converted to tape 4,500 

4) Contributed papers and reports 1,000 

5) New book titles 6,000 



341,500 

The estimated annual pertinent items of information totals 
7225 as follows: 



From 


i. 


above 


1,200 


From 


2, 


above 


500 


From 


3, 


above 


4,500 


From 


4, 


above 


1,000 


From 


5, 


above 


\ 2 5 



7,225 
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We would , in addition, investigate other sources of machine- 
readable bibliographical data produced by documentation centers, 
particularly in Western and Eastern Europe, with a view to 
cooperative exchange. 

THE OUTPUT OF THE SYSTEM 

The output of the proposed system will be the following monthly 
computer produced indexes: 

1) A list of articles and reports arranged 
alphabetically by all autnors. 

2) A keyword out of context (KWOC) index. 

A sample format from another current awareness system which is 
now operational at the Technical Information Dissemination 
Bureau is shown in Figure 1. The lists will be reduced to 
8 1/2" x 11" size and printed using an economically appro- 
priate process. The exact format of the computer output in 
the keyword out of context list will probably be ordered by 
first author in order to facilitate searching the index using 
groups of keywords. 

It is estimated that the monthly output will contain approx- 
imately six hundred entries. These will appear an average of 
two times in the author list and five times in the KWOC list. 
Thus, we anticipate that the monthly list will be printed on 
approximately 200 pages (100 leaves) . 
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Figure 1 
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THE SYSTEM. 



The system will be composed of two phases - a retrieval phase 
and a dissemination phase. The edited output from the Retrieval 
Phase will form the input for the Dissemination Phase. Each 
phase will now be described in turn. 

Retrieval Phase 

In this phase, a set of authors names, journal titles and index 
terms (or groups of terms) from an open vocabulary will be 
matched monthly with machine readable information added to the 
data bank during the past month. The search will be performed 
using word stems. In this manner, a word stem such as "Librar" 
will match Library, Libraries,- Librarian, Librarians, Librarian- 
ship, etc. Each term or term group will be given a weight between 
0 and 1.0. A match will always occur when an authors name 
matches or when a journal title matches. (An eleven digit 
alphameric code is used for the journal title) . Terms or 
term groups will be of two types - primary or secondary. Primary 
terms which match will always generate a "hit". Secondary terms 
will not generate a "hit" unless they occur in combination with 
another secondary term. The combining of only secondary terms 
can generate a "false drop.” This can be controlled by judi- 
cious term weighting. The index terms or term groups can be 
assigned weights between 0.0 and 1.0. Each term weight may be 
considered to be a probability. That is, an item containing 
the term which has weight, say .3, has probability . 3 of being 
a pertinent item. The weights of all terms or term groups are 
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combined using the equation for the probability of the union of 
two events. Thus if 

T1 = Weight of Term 1 
T2 = Weight of Term -2 
The weight for the match is 

T = Tl + T2 - (Tl) (T2) 

The value of T is passed against a threshold value and 

If T ^ TV Match 

T^. TV No Match 

Some examples of terms in the vocabulary are : 

Index 

Cost 

Effectiveness 

Library 

Some examples of term groups in the vocabulary are: 

Information Retrieval 
Information Storage 
Information Dissemination 

The exact term weights which are assigned depend on the search 
strategy which is used. The search strategy selected is not 
independent of the system!. Using the terms above, let's say 
with a threshold value of .5, we could weight all 3 of the 
latter term groups .5. Each then would generate a hit. Alter- 
nately, one could weight "Information" at say .3 and each of the 
terms "Retrieval", "Storage" and "Dissemination" at .3. Either 
method would retrieve an article which had the term group "In- 
formation Retrieval" in the title. The one strategy however 



would result in a total significance value of .5 whereas the 
other would result in a significance value of .51 (.3 + .3 - .09) 
However, the one method may result in considerably more computer 
time and "noise" than the other because in the former case the 
word group contains the term "storage" in a group whereas in 
the latter case the term "storage" occurs alone. Even though 
the term weight is low and would not by itself generate a "hit", 
it will generate a "hit" when combined with another low weight 
term and thus cause system noise. One is then faced with de- 
riving a lengthy open vocabulary of highly specific term groups 
or a shorter vocabulary of individual terms which will generate 
more system noise. In the former case, computer time increases 
as the number of term groups increases. In the latter case, 
computer time increases because the number of hits increases. 
Further, the former case will result in higher precision and 
lower recall than the latter. The authors favor higher recall 
and lower precision since it is proposed that all hits will be 
humanly edited prior to input into the dissemination phase. 
Information about items whose total significance value is 
greater than the threshold value will be output on a two part 
card (Figure 2) . Also, as output, there will be a machine 
readable record of the "hit" on magnetic tapes. These will 
form the input for the dissemination phase. 

An editor will then scan each of the cards and determine which 
will be of interest or of peripheral interest to the field. He 
will remove the right hand portion of the pertinent cards. 
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Figure 2 



Sample Cards 
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These selected "hits” (each right hand card is prepunched with 
an identification number) together with the tape output form 
the input to the dissemination system. 

We will use a dynamic open vocabulary system in the sense that 
terms and term groups will continuously be added. The terms 
will be added when they co-occur (with high frequency) in "hit" 
items with terms already in the open vocabulary. The terms 
will be selected by humanly scanning a computer produced word 
frequency ordered list of terms in "hit" items which are not 
already in the vocabulary* In this way, new terminology will 
always be added as it is introduced in the field. The open 
vocabulary will grow and the word frequency list will diminish. 
Prior to running this retrieval system, all data bank items 
will be passed against a stop list of high frequency common 
terms. The remaining terms will be alphabetically sorted as 
will the open vocabulary index terms. A terra by term match 
will then take place. Common nondescriptive term^ from the 
term frequency list previously described will be entered into 
the stop list. Thus, the stop list will grow in size. (A page 
from a sample stop list is shown in Figure 3) . 

The Dissemination Phase 

The Dissemination Phase will use as input the selected cards 
and magnetic tape output from the retrieval phase as well as 
the list of terms which will be used for the KWOC. The items 
selected by the editor will not be indexed by all significant 
words in the title. Each title will be passed against a broad 
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Figure 3 
Stop List Page 
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stop list. As intermediate putput, all terms which are not in 
the stop list will be output in word frequency order. The editor 
will select those additional words which are to be stopped. 

These will be added to the stop list. The items will then be 
again passed against the expanded stop list. Each item will 
then be sorted by author and an index of all authors "*;ill be 
generated. The items will then be sorted on terms not in the 
stop list. A KWOC index will then be generated. Each item in 
each index will contain all authors, the title, and an appro- 
priate bibliographical citation. Index terms which enrich the 
titles for retrieval purposes will be used as non stop list 
terms but will not be listed with the entry. Thus, one is likely 
to find a title such as IR on a Budget which may be enriched 
with a term such as "Information Retrieval" both under "Infor- 
mation" and under "Retrieval". An alternate sample KWOC index 
page is shown in Figure 4. Experiments are currently being 
conducted with various reduction ratios and formatting. 

The system as proposed has all the advantages of speed and 
timeliness. It also has many of the deficiencies typical of 
such computer systems. We have attempted to reduce these by 
a moderate amount of human intervention. 

SUMMARY AND CONCLUSION 

The system which we propose for the literature of library and 
information science is a real system. It has been completely 
tested and is operational (for other interest fields) at the 
Technical Information Dissemination Bureau of the State 
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Figure 4 

Sample KWOC Index 
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University of New York at Buffalo. Although we believe that a 
current awareness service for library and information science is 
needed and, as we have pointed out in the Introduction, many 
authors have reinforced our belief, we feel it would be desirable 
to test the hypothesis that current awareness for our field is 
needed and wanted. For this reason, we are preparing a proposal 
(to be completed by July, 1968) for a pilot current awareness 
service for our field. This proposal will be submitted to 
appropriate agencies for funding. 
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